Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 539127 |
| Missing cells | 124695 |
| Missing cells (%) | 1.4% |
| Duplicate rows | 3826 |
| Duplicate rows (%) | 0.7% |
| Total size in memory | 69.9 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 6 |
| Unsupported | 1 |
| Dataset has 3826 (0.7%) duplicate rows | Duplicates |
State has a high cardinality: 55 distinct values | High cardinality |
County has a high cardinality: 1808 distinct values | High cardinality |
Crop has a high cardinality: 283 distinct values | High cardinality |
State_Code is highly correlated with State_County_Code | High correlation |
State_County_Code is highly correlated with State_Code | High correlation |
Planted_Acres is highly correlated with Planted_and_Failed_Acres | High correlation |
Planted_and_Failed_Acres is highly correlated with Planted_Acres | High correlation |
State_Code is highly correlated with State_County_Code | High correlation |
State_County_Code is highly correlated with State_Code | High correlation |
Planted_Acres is highly correlated with Planted_and_Failed_Acres | High correlation |
Planted_and_Failed_Acres is highly correlated with Planted_Acres | High correlation |
State_Code is highly correlated with State_County_Code | High correlation |
State_County_Code is highly correlated with State_Code | High correlation |
Planted_Acres is highly correlated with Planted_and_Failed_Acres | High correlation |
Planted_and_Failed_Acres is highly correlated with Planted_Acres | High correlation |
Intended_Use is highly correlated with Irrigation_Practice | High correlation |
Irrigation_Practice is highly correlated with Intended_Use | High correlation |
State_Code is highly correlated with State and 1 other fields | High correlation |
County_Code is highly correlated with State | High correlation |
Crop_Code is highly correlated with Intended_Use and 1 other fields | High correlation |
State is highly correlated with State_Code and 3 other fields | High correlation |
State_County_Code is highly correlated with State_Code and 1 other fields | High correlation |
Intended_Use is highly correlated with Crop_Code and 1 other fields | High correlation |
Irrigation_Practice is highly correlated with Crop_Code and 2 other fields | High correlation |
Planted_Acres is highly correlated with Planted_and_Failed_Acres | High correlation |
Planted_and_Failed_Acres is highly correlated with Planted_Acres | High correlation |
Crop_Type has 98932 (18.4%) missing values | Missing |
Intended_Use has 25340 (4.7%) missing values | Missing |
Planted_Acres is highly skewed (γ1 = 54.10435986) | Skewed |
Volunteer_Acres is highly skewed (γ1 = 210.2360612) | Skewed |
Failed_Acres is highly skewed (γ1 = 138.0075859) | Skewed |
Prevented_Acres is highly skewed (γ1 = 72.64528079) | Skewed |
Not_Planted_Acres is highly skewed (γ1 = 131.4743886) | Skewed |
Planted_and_Failed_Acres is highly skewed (γ1 = 54.01257506) | Skewed |
Crop_Type is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Planted_Acres has 17658 (3.3%) zeros | Zeros |
Volunteer_Acres has 510568 (94.7%) zeros | Zeros |
Failed_Acres has 529907 (98.3%) zeros | Zeros |
Prevented_Acres has 521451 (96.7%) zeros | Zeros |
Not_Planted_Acres has 522162 (96.9%) zeros | Zeros |
Planted_and_Failed_Acres has 17116 (3.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-23 15:10:30.753196 |
|---|---|
| Analysis finished | 2022-05-23 15:11:55.214873 |
| Duration | 1 minute and 24.46 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.05325832 |
| Minimum | 1 |
|---|---|
| Maximum | 72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 19 |
| median | 30 |
| Q3 | 42 |
| 95-th percentile | 54 |
| Maximum | 72 |
| Range | 71 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 14.08137574 |
|---|---|
| Coefficient of variation (CV) | 0.4534588799 |
| Kurtosis | -0.7979441515 |
| Mean | 31.05325832 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.01330404755 |
| Sum | 16741650 |
| Variance | 198.2851426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 35657 | 6.6% |
| 27 | 26281 | 4.9% |
| 31 | 26132 | 4.8% |
| 19 | 25414 | 4.7% |
| 20 | 22637 | 4.2% |
| 55 | 22074 | 4.1% |
| 17 | 19790 | 3.7% |
| 39 | 18846 | 3.5% |
| 26 | 18214 | 3.4% |
| 29 | 18123 | 3.4% |
| Other values (45) | 305959 |
| Value | Count | Frequency (%) |
| 1 | 8923 | |
| 2 | 301 | 0.1% |
| 4 | 1488 | 0.3% |
| 5 | 6054 | 1.1% |
| 6 | 9513 | |
| 8 | 9760 | |
| 9 | 2172 | 0.4% |
| 10 | 1056 | 0.2% |
| 12 | 6537 | 1.2% |
| 13 | 16963 |
| Value | Count | Frequency (%) |
| 72 | 1320 | 0.2% |
| 69 | 33 | < 0.1% |
| 60 | 15 | < 0.1% |
| 56 | 3397 | 0.6% |
| 55 | 22074 | |
| 54 | 3969 | 0.7% |
| 53 | 7170 | 1.3% |
| 52 | 331 | 0.1% |
| 51 | 10874 | |
| 50 | 1700 | 0.3% |
| Distinct | 272 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90.68761906 |
| Minimum | 1 |
|---|---|
| Maximum | 810 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 33 |
| median | 75 |
| Q3 | 125 |
| 95-th percentile | 217 |
| Maximum | 810 |
| Range | 809 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 81.23338367 |
|---|---|
| Coefficient of variation (CV) | 0.8957494365 |
| Kurtosis | 9.532978197 |
| Mean | 90.68761906 |
| Median Absolute Deviation (MAD) | 46 |
| Skewness | 2.319417732 |
| Sum | 48892144 |
| Variance | 6598.862622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 9962 | 1.8% |
| 11 | 9838 | 1.8% |
| 3 | 9241 | 1.7% |
| 21 | 9236 | 1.7% |
| 1 | 8843 | 1.6% |
| 27 | 8721 | 1.6% |
| 25 | 8571 | 1.6% |
| 19 | 8457 | 1.6% |
| 13 | 8166 | 1.5% |
| 15 | 8099 | 1.5% |
| Other values (262) | 449993 |
| Value | Count | Frequency (%) |
| 1 | 8843 | |
| 2 | 388 | 0.1% |
| 3 | 9241 | |
| 4 | 149 | < 0.1% |
| 5 | 9962 | |
| 6 | 19 | < 0.1% |
| 7 | 6989 | |
| 9 | 7633 | |
| 11 | 9838 | |
| 12 | 90 | < 0.1% |
| Value | Count | Frequency (%) |
| 810 | 148 | |
| 800 | 257 | |
| 550 | 75 | < 0.1% |
| 510 | 3 | < 0.1% |
| 507 | 219 | |
| 505 | 30 | < 0.1% |
| 503 | 161 | |
| 501 | 361 | |
| 499 | 37 | < 0.1% |
| 497 | 140 | < 0.1% |
| Distinct | 281 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 405.2536026 |
| Minimum | 1 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 67 |
| median | 102 |
| Q3 | 158 |
| 95-th percentile | 1218 |
| Maximum | 9999 |
| Range | 9998 |
| Interquartile range (IQR) | 91 |
Descriptive statistics
| Standard deviation | 1239.064331 |
|---|---|
| Coefficient of variation (CV) | 3.057503555 |
| Kurtosis | 24.39880847 |
| Mean | 405.2536026 |
| Median Absolute Deviation (MAD) | 49 |
| Skewness | 4.964466117 |
| Sum | 218483159 |
| Variance | 1535280.415 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102 | 91596 | |
| 99 | 62200 | 11.5% |
| 296 | 52333 | 9.7% |
| 41 | 28757 | 5.3% |
| 11 | 21741 | 4.0% |
| 16 | 14181 | 2.6% |
| 53 | 13462 | 2.5% |
| 81 | 12029 | 2.2% |
| 27 | 10637 | 2.0% |
| 94 | 10573 | 2.0% |
| Other values (271) | 221618 |
| Value | Count | Frequency (%) |
| 1 | 572 | 0.1% |
| 2 | 512 | 0.1% |
| 3 | 49 | < 0.1% |
| 4 | 85 | < 0.1% |
| 5 | 144 | < 0.1% |
| 7 | 26 | < 0.1% |
| 8 | 66 | < 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 59 | < 0.1% |
| 11 | 21741 |
| Value | Count | Frequency (%) |
| 9999 | 2 | < 0.1% |
| 9998 | 24 | < 0.1% |
| 9997 | 6 | < 0.1% |
| 9996 | 10 | < 0.1% |
| 9995 | 3 | < 0.1% |
| 9994 | 20 | < 0.1% |
| 9993 | 3 | < 0.1% |
| 9992 | 2 | < 0.1% |
| 9907 | 3 | < 0.1% |
| 9906 | 68 |
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| Texas | 35657 |
|---|---|
| Minnesota | 26281 |
| Nebraska | 26132 |
| Iowa | 25414 |
| Kansas | 22637 |
| Other values (50) |
Length
| Max length | 26 |
|---|---|
| Median length | 14 |
| Mean length | 8.088205562 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4360570 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alabama |
|---|---|
| 2nd row | Alabama |
| 3rd row | Alabama |
| 4th row | Alabama |
| 5th row | Alabama |
Common Values
| Value | Count | Frequency (%) |
| Texas | 35657 | 6.6% |
| Minnesota | 26281 | 4.9% |
| Nebraska | 26132 | 4.8% |
| Iowa | 25414 | 4.7% |
| Kansas | 22637 | 4.2% |
| Wisconsin | 22074 | 4.1% |
| Illinois | 19790 | 3.7% |
| Ohio | 18846 | 3.5% |
| Michigan | 18214 | 3.4% |
| Missouri | 18123 | 3.4% |
| Other values (45) | 305959 |
Length
| Value | Count | Frequency (%) |
| texas | 35657 | 5.7% |
| dakota | 31265 | 5.0% |
| north | 30962 | 4.9% |
| minnesota | 26281 | 4.2% |
| nebraska | 26132 | 4.2% |
| iowa | 25414 | 4.0% |
| south | 25139 | 4.0% |
| carolina | 24836 | 4.0% |
| new | 24641 | 3.9% |
| kansas | 22637 | 3.6% |
| Other values (54) | 354890 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 604115 | |
| i | 409135 | 9.4% |
| n | 402282 | 9.2% |
| o | 398700 | 9.1% |
| s | 345588 | 7.9% |
| e | 236518 | 5.4% |
| r | 206628 | 4.7% |
| t | 172292 | 4.0% |
| h | 134213 | 3.1% |
| l | 134211 | 3.1% |
| Other values (37) | 1316888 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3643658 | |
| Uppercase Letter | 627523 | 14.4% |
| Space Separator | 88727 | 2.0% |
| Other Punctuation | 662 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 604115 | |
| i | 409135 | |
| n | 402282 | |
| o | 398700 | |
| s | 345588 | |
| e | 236518 | 6.5% |
| r | 206628 | 5.7% |
| t | 172292 | 4.7% |
| h | 134213 | 3.7% |
| l | 134211 | 3.7% |
| Other values (14) | 599976 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 100864 | |
| N | 82846 | |
| I | 74945 | |
| C | 46281 | |
| T | 44765 | |
| O | 42356 | |
| W | 36610 | 5.8% |
| K | 35711 | 5.7% |
| D | 32321 | 5.2% |
| S | 25485 | 4.1% |
| Other values (11) | 105339 |
Space Separator
| Value | Count | Frequency (%) |
| 88727 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 662 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4271181 | |
| Common | 89389 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 604115 | |
| i | 409135 | 9.6% |
| n | 402282 | 9.4% |
| o | 398700 | 9.3% |
| s | 345588 | 8.1% |
| e | 236518 | 5.5% |
| r | 206628 | 4.8% |
| t | 172292 | 4.0% |
| h | 134213 | 3.1% |
| l | 134211 | 3.1% |
| Other values (35) | 1227499 |
Common
| Value | Count | Frequency (%) |
| 88727 | ||
| . | 662 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4360570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 604115 | |
| i | 409135 | 9.4% |
| n | 402282 | 9.2% |
| o | 398700 | 9.1% |
| s | 345588 | 7.9% |
| e | 236518 | 5.4% |
| r | 206628 | 4.7% |
| t | 172292 | 4.0% |
| h | 134213 | 3.1% |
| l | 134211 | 3.1% |
| Other values (37) | 1316888 |
| Distinct | 1808 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| Washington | 5964 |
|---|---|
| Franklin | 4630 |
| Jackson | 4389 |
| Jefferson | 4262 |
| Lincoln | 3798 |
| Other values (1803) |
Length
| Max length | 44 |
|---|---|
| Median length | 33 |
| Mean length | 7.091462679 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3823199 |
|---|---|
| Distinct characters | 59 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Autauga |
|---|---|
| 2nd row | Autauga |
| 3rd row | Autauga |
| 4th row | Autauga |
| 5th row | Autauga |
Common Values
| Value | Count | Frequency (%) |
| Washington | 5964 | 1.1% |
| Franklin | 4630 | 0.9% |
| Jackson | 4389 | 0.8% |
| Jefferson | 4262 | 0.8% |
| Lincoln | 3798 | 0.7% |
| Madison | 3243 | 0.6% |
| Adams | 3057 | 0.6% |
| Marion | 2909 | 0.5% |
| Monroe | 2842 | 0.5% |
| Clay | 2813 | 0.5% |
| Other values (1798) | 501220 |
Length
| Value | Count | Frequency (%) |
| washington | 6053 | 1.0% |
| franklin | 4824 | 0.8% |
| jefferson | 4406 | 0.8% |
| jackson | 4389 | 0.8% |
| st | 3824 | 0.7% |
| lincoln | 3798 | 0.6% |
| madison | 3243 | 0.6% |
| adams | 3057 | 0.5% |
| monroe | 3013 | 0.5% |
| marion | 2909 | 0.5% |
| Other values (1823) | 545485 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 372853 | 9.8% |
| a | 372229 | 9.7% |
| n | 317793 | 8.3% |
| o | 292720 | 7.7% |
| r | 259493 | 6.8% |
| l | 213885 | 5.6% |
| i | 197175 | 5.2% |
| t | 176418 | 4.6% |
| s | 170300 | 4.5% |
| u | 100682 | 2.6% |
| Other values (49) | 1349651 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3169909 | |
| Uppercase Letter | 591341 | 15.5% |
| Space Separator | 45874 | 1.2% |
| Other Punctuation | 13957 | 0.4% |
| Decimal Number | 2118 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 372853 | |
| a | 372229 | |
| n | 317793 | |
| o | 292720 | |
| r | 259493 | 8.2% |
| l | 213885 | 6.7% |
| i | 197175 | 6.2% |
| t | 176418 | 5.6% |
| s | 170300 | 5.4% |
| u | 100682 | 3.2% |
| Other values (16) | 696361 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 68369 | 11.6% |
| M | 53966 | 9.1% |
| S | 48147 | 8.1% |
| B | 44974 | 7.6% |
| W | 41252 | 7.0% |
| L | 38958 | 6.6% |
| P | 34313 | 5.8% |
| H | 33777 | 5.7% |
| G | 26514 | 4.5% |
| D | 26152 | 4.4% |
| Other values (15) | 174919 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 6724 | |
| . | 4056 | |
| & | 1059 | 7.6% |
| # | 1059 | 7.6% |
| ; | 1059 | 7.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1059 | |
| 9 | 1059 |
Space Separator
| Value | Count | Frequency (%) |
| 45874 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3761250 | |
| Common | 61949 | 1.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 372853 | 9.9% |
| a | 372229 | 9.9% |
| n | 317793 | 8.4% |
| o | 292720 | 7.8% |
| r | 259493 | 6.9% |
| l | 213885 | 5.7% |
| i | 197175 | 5.2% |
| t | 176418 | 4.7% |
| s | 170300 | 4.5% |
| u | 100682 | 2.7% |
| Other values (41) | 1287702 |
Common
| Value | Count | Frequency (%) |
| 45874 | ||
| , | 6724 | 10.9% |
| . | 4056 | 6.5% |
| & | 1059 | 1.7% |
| # | 1059 | 1.7% |
| 3 | 1059 | 1.7% |
| 9 | 1059 | 1.7% |
| ; | 1059 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3823199 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 372853 | 9.8% |
| a | 372229 | 9.7% |
| n | 317793 | 8.3% |
| o | 292720 | 7.7% |
| r | 259493 | 6.8% |
| l | 213885 | 5.6% |
| i | 197175 | 5.2% |
| t | 176418 | 4.6% |
| s | 170300 | 4.5% |
| u | 100682 | 2.6% |
| Other values (49) | 1349651 |
State_County_Code
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3069 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 423 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31111.80045 |
| Minimum | 1001 |
|---|---|
| Maximum | 72141 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 8007 |
| Q1 | 19181 |
| median | 30075 |
| Q3 | 42039 |
| 95-th percentile | 54089 |
| Maximum | 72141 |
| Range | 71140 |
| Interquartile range (IQR) | 22858 |
Descriptive statistics
| Standard deviation | 14046.55323 |
|---|---|
| Coefficient of variation (CV) | 0.4514863503 |
| Kurtosis | -0.8269182426 |
| Mean | 31111.80045 |
| Median Absolute Deviation (MAD) | 10968 |
| Skewness | -0.02568760535 |
| Sum | 1.676005135 × 1010 |
| Variance | 197305657.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26021 | 931 | 0.2% |
| 27037 | 854 | 0.2% |
| 27145 | 761 | 0.1% |
| 41047 | 709 | 0.1% |
| 8123 | 706 | 0.1% |
| 36067 | 701 | 0.1% |
| 41059 | 699 | 0.1% |
| 48445 | 693 | 0.1% |
| 30013 | 656 | 0.1% |
| 55021 | 649 | 0.1% |
| Other values (3059) | 531345 |
| Value | Count | Frequency (%) |
| 1001 | 247 | |
| 1003 | 274 | |
| 1005 | 192 | |
| 1007 | 61 | < 0.1% |
| 1009 | 219 | |
| 1011 | 108 | < 0.1% |
| 1013 | 105 | < 0.1% |
| 1015 | 121 | |
| 1017 | 142 | |
| 1019 | 172 |
| Value | Count | Frequency (%) |
| 72141 | 81 | |
| 72113 | 100 | |
| 72097 | 187 | |
| 72081 | 70 | < 0.1% |
| 72047 | 77 | |
| 72025 | 74 | < 0.1% |
| 72019 | 110 | |
| 72013 | 133 | |
| 72001 | 65 | < 0.1% |
| 69110 | 32 | < 0.1% |
| Distinct | 283 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| GRASS | |
|---|---|
| CRP | |
| MIXED FORAGE | |
| CORN | |
| WHEAT | 21741 |
| Other values (278) |
Length
| Max length | 34 |
|---|---|
| Median length | 27 |
| Mean length | 7.013575651 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3781208 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WHEAT |
|---|---|
| 2nd row | WHEAT |
| 3rd row | OATS |
| 4th row | OATS |
| 5th row | OATS |
Common Values
| Value | Count | Frequency (%) |
| GRASS | 91596 | |
| CRP | 62200 | 11.5% |
| MIXED FORAGE | 52333 | 9.7% |
| CORN | 28757 | 5.3% |
| WHEAT | 21741 | 4.0% |
| OATS | 14181 | 2.6% |
| GRAPES | 13462 | 2.5% |
| SOYBEANS | 12029 | 2.2% |
| ALFALFA | 10637 | 2.0% |
| RYE | 10573 | 2.0% |
| Other values (273) | 221618 |
Length
| Value | Count | Frequency (%) |
| grass | 91596 | 14.0% |
| crp | 62200 | 9.5% |
| forage | 61498 | 9.4% |
| mixed | 52333 | 8.0% |
| corn | 28757 | 4.4% |
| wheat | 21741 | 3.3% |
| sorghum | 17751 | 2.7% |
| oats | 14181 | 2.2% |
| grapes | 13462 | 2.1% |
| soybeans | 12029 | 1.8% |
| Other values (315) | 281028 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 434693 | 11.5% |
| E | 404263 | 10.7% |
| S | 400081 | 10.6% |
| A | 378609 | 10.0% |
| O | 257467 | 6.8% |
| G | 208296 | 5.5% |
| P | 170053 | 4.5% |
| C | 162592 | 4.3% |
| T | 138632 | 3.7% |
| L | 135522 | 3.6% |
| Other values (23) | 1091000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3647301 | |
| Space Separator | 131754 | 3.5% |
| Other Punctuation | 954 | < 0.1% |
| Open Punctuation | 584 | < 0.1% |
| Close Punctuation | 584 | < 0.1% |
| Decimal Number | 31 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 434693 | |
| E | 404263 | |
| S | 400081 | |
| A | 378609 | 10.4% |
| O | 257467 | 7.1% |
| G | 208296 | 5.7% |
| P | 170053 | 4.7% |
| C | 162592 | 4.5% |
| T | 138632 | 3.8% |
| L | 135522 | 3.7% |
| Other values (16) | 957093 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 3 | 10 | |
| 1 | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 131754 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 954 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 584 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 584 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3647301 | |
| Common | 133907 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 434693 | |
| E | 404263 | |
| S | 400081 | |
| A | 378609 | 10.4% |
| O | 257467 | 7.1% |
| G | 208296 | 5.7% |
| P | 170053 | 4.7% |
| C | 162592 | 4.5% |
| T | 138632 | 3.8% |
| L | 135522 | 3.7% |
| Other values (16) | 957093 |
Common
| Value | Count | Frequency (%) |
| 131754 | ||
| / | 954 | 0.7% |
| ( | 584 | 0.4% |
| ) | 584 | 0.4% |
| 0 | 11 | < 0.1% |
| 3 | 10 | < 0.1% |
| 1 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3781208 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 434693 | 11.5% |
| E | 404263 | 10.7% |
| S | 400081 | 10.6% |
| A | 378609 | 10.0% |
| O | 257467 | 6.8% |
| G | 208296 | 5.5% |
| P | 170053 | 4.5% |
| C | 162592 | 4.3% |
| T | 138632 | 3.7% |
| L | 135522 | 3.6% |
| Other values (23) | 1091000 |
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 25340 |
| Missing (%) | 4.7% |
| Memory size | 4.1 MiB |
| Forage | |
|---|---|
| Fresh | |
| Blank | |
| Grazing | |
| Grain | |
| Other values (24) |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 6.227061019 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3199383 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Grain |
|---|---|
| 2nd row | Seed |
| 3rd row | Grazing |
| 4th row | Grain |
| 5th row | Grazing |
Common Values
| Value | Count | Frequency (%) |
| Forage | 107064 | |
| Fresh | 95976 | |
| Blank | 81833 | |
| Grazing | 69946 | |
| Grain | 61445 | |
| Left Standing | 28253 | 5.2% |
| Processed | 19405 | 3.6% |
| Seed | 18155 | 3.4% |
| Cover Only | 8258 | 1.5% |
| Silage | 5589 | 1.0% |
| Other values (19) | 17863 | 3.3% |
| (Missing) | 25340 | 4.7% |
Length
| Value | Count | Frequency (%) |
| forage | 107064 | |
| fresh | 95976 | |
| blank | 81833 | |
| grazing | 69946 | |
| grain | 61445 | |
| left | 28253 | 5.0% |
| standing | 28253 | 5.0% |
| processed | 19405 | 3.5% |
| seed | 18155 | 3.2% |
| cover | 8258 | 1.5% |
| Other values (23) | 41272 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 373621 | |
| a | 362005 | |
| e | 338022 | |
| n | 285494 | 8.9% |
| g | 210876 | 6.6% |
| F | 203238 | 6.4% |
| i | 171920 | 5.4% |
| o | 138854 | 4.3% |
| s | 137461 | 4.3% |
| G | 135211 | 4.2% |
| Other values (28) | 842681 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2589690 | |
| Uppercase Letter | 562916 | 17.6% |
| Space Separator | 46073 | 1.4% |
| Other Punctuation | 704 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 373621 | |
| a | 362005 | |
| e | 338022 | |
| n | 285494 | |
| g | 210876 | |
| i | 171920 | |
| o | 138854 | 5.4% |
| s | 137461 | 5.3% |
| l | 103688 | 4.0% |
| h | 95976 | 3.7% |
| Other values (11) | 371773 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 203238 | |
| G | 135211 | |
| B | 83009 | |
| S | 55268 | 9.8% |
| L | 28273 | 5.0% |
| P | 21457 | 3.8% |
| C | 9434 | 1.7% |
| O | 8665 | 1.5% |
| D | 6514 | 1.2% |
| E | 5377 | 1.0% |
| Other values (5) | 6470 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 46073 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 704 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3152606 | |
| Common | 46777 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 373621 | |
| a | 362005 | |
| e | 338022 | |
| n | 285494 | 9.1% |
| g | 210876 | 6.7% |
| F | 203238 | 6.4% |
| i | 171920 | 5.5% |
| o | 138854 | 4.4% |
| s | 137461 | 4.4% |
| G | 135211 | 4.3% |
| Other values (26) | 795904 |
Common
| Value | Count | Frequency (%) |
| 46073 | ||
| / | 704 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3199383 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 373621 | |
| a | 362005 | |
| e | 338022 | |
| n | 285494 | 8.9% |
| g | 210876 | 6.6% |
| F | 203238 | 6.4% |
| i | 171920 | 5.4% |
| o | 138854 | 4.3% |
| s | 137461 | 4.3% |
| G | 135211 | 4.2% |
| Other values (28) | 842681 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| N | |
|---|---|
| I | |
| O | 3316 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 539127 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N |
|---|---|
| 2nd row | N |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 406616 | |
| I | 129195 | 24.0% |
| O | 3316 | 0.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| n | 406616 | |
| i | 129195 | 24.0% |
| o | 3316 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 406616 | |
| I | 129195 | 24.0% |
| O | 3316 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 539127 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 406616 | |
| I | 129195 | 24.0% |
| O | 3316 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 539127 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 406616 | |
| I | 129195 | 24.0% |
| O | 3316 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 539127 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 406616 | |
| I | 129195 | 24.0% |
| O | 3316 | 0.6% |
Planted_Acres
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 150912 |
|---|---|
| Distinct (%) | 28.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3775.961195 |
| Minimum | 0 |
|---|---|
| Maximum | 6914872.27 |
| Zeros | 17658 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.065 |
| Q1 | 6.01 |
| median | 51.2 |
| Q3 | 365.42 |
| 95-th percentile | 7972.457 |
| Maximum | 6914872.27 |
| Range | 6914872.27 |
| Interquartile range (IQR) | 359.41 |
Descriptive statistics
| Standard deviation | 45591.58925 |
|---|---|
| Coefficient of variation (CV) | 12.07416785 |
| Kurtosis | 4855.667575 |
| Mean | 3775.961195 |
| Median Absolute Deviation (MAD) | 50.7 |
| Skewness | 54.10435986 |
| Sum | 2035722631 |
| Variance | 2078593011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17658 | 3.3% |
| 1 | 5398 | 1.0% |
| 0.5 | 3883 | 0.7% |
| 2 | 3873 | 0.7% |
| 0.1 | 2910 | 0.5% |
| 3 | 2432 | 0.5% |
| 5 | 2314 | 0.4% |
| 0.25 | 2091 | 0.4% |
| 4 | 1951 | 0.4% |
| 1.5 | 1931 | 0.4% |
| Other values (150902) | 494686 |
| Value | Count | Frequency (%) |
| 0 | 17658 | |
| 0.0001 | 48 | < 0.1% |
| 0.0002 | 16 | < 0.1% |
| 0.0003 | 10 | < 0.1% |
| 0.0004 | 5 | < 0.1% |
| 0.0005 | 7 | < 0.1% |
| 0.0006 | 24 | < 0.1% |
| 0.0007 | 11 | < 0.1% |
| 0.0008 | 8 | < 0.1% |
| 0.0009 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 6914872.27 | 1 | |
| 6128962.12 | 1 | |
| 5578955.55 | 1 | |
| 5533689.1 | 1 | |
| 5461513.27 | 1 | |
| 5424527.57 | 1 | |
| 4754721.09 | 1 | |
| 4315211.08 | 1 | |
| 4230149.13 | 1 | |
| 4226050.6 | 1 |
| Distinct | 14413 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 122.2206267 |
| Minimum | 0 |
|---|---|
| Maximum | 3205029.9 |
| Zeros | 510568 |
| Zeros (%) | 94.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1.38 |
| Maximum | 3205029.9 |
| Range | 3205029.9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 10725.21554 |
|---|---|
| Coefficient of variation (CV) | 87.75290912 |
| Kurtosis | 52098.19125 |
| Mean | 122.2206267 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 210.2360612 |
| Sum | 65892439.79 |
| Variance | 115030248.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 510568 | |
| 1 | 124 | < 0.1% |
| 2 | 124 | < 0.1% |
| 3 | 99 | < 0.1% |
| 5 | 91 | < 0.1% |
| 4 | 88 | < 0.1% |
| 10 | 82 | < 0.1% |
| 0.5 | 73 | < 0.1% |
| 1.5 | 54 | < 0.1% |
| 0.6 | 42 | < 0.1% |
| Other values (14403) | 27782 | 5.2% |
| Value | Count | Frequency (%) |
| 0 | 510568 | |
| 0.001 | 1 | < 0.1% |
| 0.002 | 2 | < 0.1% |
| 0.01 | 1 | < 0.1% |
| 0.03 | 5 | < 0.1% |
| 0.0399 | 1 | < 0.1% |
| 0.04 | 6 | < 0.1% |
| 0.046 | 1 | < 0.1% |
| 0.05 | 5 | < 0.1% |
| 0.06 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 3205029.9 | 1 | |
| 2943698.03 | 1 | |
| 2804325.5 | 1 | |
| 2774579.4 | 1 | |
| 2576408.947 | 1 | |
| 1661772.11 | 1 | |
| 1618524.667 | 1 | |
| 1194009.02 | 1 | |
| 1119797.44 | 1 | |
| 1114766.57 | 1 |
| Distinct | 6835 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.63542055 |
| Minimum | 0 |
|---|---|
| Maximum | 245694.68 |
| Zeros | 529907 |
| Zeros (%) | 98.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 245694.68 |
| Range | 245694.68 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 908.3418301 |
|---|---|
| Coefficient of variation (CV) | 58.0951326 |
| Kurtosis | 25499.62461 |
| Mean | 15.63542055 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 138.0075859 |
| Sum | 8429477.374 |
| Variance | 825084.8803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 529907 | |
| 10 | 51 | < 0.1% |
| 2 | 50 | < 0.1% |
| 5 | 45 | < 0.1% |
| 1 | 45 | < 0.1% |
| 15 | 43 | < 0.1% |
| 30 | 39 | < 0.1% |
| 4 | 37 | < 0.1% |
| 3 | 37 | < 0.1% |
| 20 | 35 | < 0.1% |
| Other values (6825) | 8838 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 529907 | |
| 0.0004 | 1 | < 0.1% |
| 0.001 | 10 | < 0.1% |
| 0.002 | 2 | < 0.1% |
| 0.003 | 1 | < 0.1% |
| 0.0046 | 1 | < 0.1% |
| 0.005 | 10 | < 0.1% |
| 0.0057 | 1 | < 0.1% |
| 0.01 | 5 | < 0.1% |
| 0.0101 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 245694.68 | 1 | |
| 228193.03 | 1 | |
| 135270.325 | 1 | |
| 131140.46 | 1 | |
| 127969.01 | 1 | |
| 123209.77 | 1 | |
| 122006.88 | 1 | |
| 121225.81 | 1 | |
| 118441.24 | 1 | |
| 115055.02 | 1 |
| Distinct | 14217 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.16144427 |
| Minimum | 0 |
|---|---|
| Maximum | 259649.49 |
| Zeros | 521451 |
| Zeros (%) | 96.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 259649.49 |
| Range | 259649.49 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1377.104224 |
|---|---|
| Coefficient of variation (CV) | 22.89014569 |
| Kurtosis | 8367.386719 |
| Mean | 60.16144427 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 72.64528079 |
| Sum | 32434658.96 |
| Variance | 1896416.044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 521451 | |
| 5 | 86 | < 0.1% |
| 2 | 73 | < 0.1% |
| 1 | 69 | < 0.1% |
| 10 | 66 | < 0.1% |
| 20 | 60 | < 0.1% |
| 3 | 59 | < 0.1% |
| 4 | 58 | < 0.1% |
| 0.5 | 47 | < 0.1% |
| 6 | 41 | < 0.1% |
| Other values (14207) | 17117 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 521451 | |
| 0.0011 | 1 | < 0.1% |
| 0.0057 | 2 | < 0.1% |
| 0.01 | 8 | < 0.1% |
| 0.02 | 2 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| 0.06 | 2 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.1 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 259649.49 | 1 | |
| 221519.63 | 1 | |
| 208771.005 | 1 | |
| 186101.96 | 1 | |
| 168317.73 | 1 | |
| 167046.4 | 1 | |
| 166986.51 | 1 | |
| 154883.75 | 1 | |
| 128749.43 | 1 | |
| 128401.03 | 1 |
| Distinct | 10278 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.3765942 |
| Minimum | 0 |
|---|---|
| Maximum | 435738.97 |
| Zeros | 522162 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 435738.97 |
| Range | 435738.97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2008.62201 |
|---|---|
| Coefficient of variation (CV) | 42.3969271 |
| Kurtosis | 22337.2707 |
| Mean | 47.3765942 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 131.4743886 |
| Sum | 25542001.1 |
| Variance | 4034562.381 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 522162 | |
| 2 | 122 | < 0.1% |
| 1 | 114 | < 0.1% |
| 4 | 106 | < 0.1% |
| 3 | 89 | < 0.1% |
| 6 | 80 | < 0.1% |
| 5 | 80 | < 0.1% |
| 10 | 60 | < 0.1% |
| 8 | 57 | < 0.1% |
| 7 | 54 | < 0.1% |
| Other values (10268) | 16203 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 522162 | |
| 0.0025 | 1 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.01 | 3 | < 0.1% |
| 0.012 | 1 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.03 | 3 | < 0.1% |
| 0.04 | 4 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.06 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 435738.97 | 1 | |
| 435141.36 | 1 | |
| 429022.69 | 1 | |
| 386085.44 | 1 | |
| 380418.93 | 1 | |
| 359695.39 | 1 | |
| 223938.88 | 1 | |
| 222664.42 | 1 | |
| 216260.31 | 1 | |
| 211978.48 | 1 |
Planted_and_Failed_Acres
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 151418 |
|---|---|
| Distinct (%) | 28.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3791.596615 |
| Minimum | 0 |
|---|---|
| Maximum | 6914872.27 |
| Zeros | 17116 |
| Zeros (%) | 3.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.07 |
| Q1 | 6.08 |
| median | 51.45 |
| Q3 | 366.375 |
| 95-th percentile | 8009.723 |
| Maximum | 6914872.27 |
| Range | 6914872.27 |
| Interquartile range (IQR) | 360.295 |
Descriptive statistics
| Standard deviation | 45619.42721 |
|---|---|
| Coefficient of variation (CV) | 12.03171957 |
| Kurtosis | 4843.861243 |
| Mean | 3791.596615 |
| Median Absolute Deviation (MAD) | 50.95 |
| Skewness | 54.01257506 |
| Sum | 2044152108 |
| Variance | 2081132139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17116 | 3.2% |
| 1 | 5405 | 1.0% |
| 0.5 | 3884 | 0.7% |
| 2 | 3869 | 0.7% |
| 0.1 | 2923 | 0.5% |
| 3 | 2437 | 0.5% |
| 5 | 2318 | 0.4% |
| 0.25 | 2091 | 0.4% |
| 4 | 1955 | 0.4% |
| 1.5 | 1933 | 0.4% |
| Other values (151408) | 495196 |
| Value | Count | Frequency (%) |
| 0 | 17116 | |
| 0.0001 | 48 | < 0.1% |
| 0.0002 | 16 | < 0.1% |
| 0.0003 | 10 | < 0.1% |
| 0.0004 | 6 | < 0.1% |
| 0.0005 | 7 | < 0.1% |
| 0.0006 | 24 | < 0.1% |
| 0.0007 | 11 | < 0.1% |
| 0.0008 | 8 | < 0.1% |
| 0.0009 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 6914872.27 | 1 | |
| 6128962.12 | 1 | |
| 5578955.55 | 1 | |
| 5533689.1 | 1 | |
| 5461513.27 | 1 | |
| 5424527.57 | 1 | |
| 4754721.09 | 1 | |
| 4315211.08 | 1 | |
| 4230149.13 | 1 | |
| 4226050.6 | 1 |
Crop_Year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| 2020 | |
|---|---|
| 2019 | |
| 2018 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2156508 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018 |
|---|---|
| 2nd row | 2018 |
| 3rd row | 2018 |
| 4th row | 2018 |
| 5th row | 2018 |
Common Values
| Value | Count | Frequency (%) |
| 2020 | 185797 | |
| 2019 | 185028 | |
| 2018 | 168302 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2020 | 185797 | |
| 2019 | 185028 | |
| 2018 | 168302 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 724924 | |
| 0 | 724924 | |
| 1 | 353330 | |
| 9 | 185028 | 8.6% |
| 8 | 168302 | 7.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2156508 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 724924 | |
| 0 | 724924 | |
| 1 | 353330 | |
| 9 | 185028 | 8.6% |
| 8 | 168302 | 7.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2156508 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 724924 | |
| 0 | 724924 | |
| 1 | 353330 | |
| 9 | 185028 | 8.6% |
| 8 | 168302 | 7.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2156508 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 724924 | |
| 0 | 724924 | |
| 1 | 353330 | |
| 9 | 185028 | 8.6% |
| 8 | 168302 | 7.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| State_Code | County_Code | Crop_Code | State | County | State_County_Code | Crop | Crop_Type | Intended_Use | Irrigation_Practice | Planted_Acres | Volunteer_Acres | Failed_Acres | Prevented_Acres | Not_Planted_Acres | Planted_and_Failed_Acres | Crop_Year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | 11 | Alabama | Autauga | 1001.0 | WHEAT | SOFT RED WINTER | Grain | N | 161.00 | 0.0 | 0.0 | 0.0 | 0.0 | 161.00 | 2018 |
| 1 | 1 | 1 | 11 | Alabama | Autauga | 1001.0 | WHEAT | SOFT RED WINTER | Seed | N | 26.78 | 0.0 | 0.0 | 0.0 | 0.0 | 26.78 | 2018 |
| 2 | 1 | 1 | 16 | Alabama | Autauga | 1001.0 | OATS | HULLESS WINTER | Grazing | N | 47.25 | 0.0 | 0.0 | 0.0 | 0.0 | 47.25 | 2018 |
| 3 | 1 | 1 | 16 | Alabama | Autauga | 1001.0 | OATS | WINTER | Grain | N | 166.19 | 0.0 | 0.0 | 0.0 | 0.0 | 166.19 | 2018 |
| 4 | 1 | 1 | 16 | Alabama | Autauga | 1001.0 | OATS | WINTER | Grazing | N | 3.37 | 0.0 | 0.0 | 0.0 | 0.0 | 3.37 | 2018 |
| 5 | 1 | 1 | 16 | Alabama | Autauga | 1001.0 | OATS | WINTER | Seed | N | 22.32 | 0.0 | 0.0 | 0.0 | 0.0 | 22.32 | 2018 |
| 6 | 1 | 1 | 21 | Alabama | Autauga | 1001.0 | COTTON UPLAND | NaN | NaN | I | 794.80 | 0.0 | 0.0 | 0.0 | 0.0 | 794.80 | 2018 |
| 7 | 1 | 1 | 21 | Alabama | Autauga | 1001.0 | COTTON UPLAND | NaN | NaN | N | 8359.27 | 0.0 | 0.0 | 0.0 | 0.0 | 8359.27 | 2018 |
| 8 | 1 | 1 | 34 | Alabama | Autauga | 1001.0 | PEACHES | CLING PEACHES | Fresh | N | 4.00 | 0.0 | 0.0 | 0.0 | 0.0 | 4.00 | 2018 |
| 9 | 1 | 1 | 34 | Alabama | Autauga | 1001.0 | PEACHES | FREESTONE LATE SEASON | Fresh | N | 5.00 | 0.0 | 0.0 | 0.0 | 0.0 | 5.00 | 2018 |
Last rows
| State_Code | County_Code | Crop_Code | State | County | State_County_Code | Crop | Crop_Type | Intended_Use | Irrigation_Practice | Planted_Acres | Volunteer_Acres | Failed_Acres | Prevented_Acres | Not_Planted_Acres | Planted_and_Failed_Acres | Crop_Year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 539117 | 72 | 141 | 1010 | Puerto Rico | Utuado | 72141.0 | NURSERY | CONTAINER | Blank | I | 1.9800 | 0.0 | 0.0 | 0.0 | 0.00 | 1.9800 | 2020 |
| 539118 | 72 | 141 | 1010 | Puerto Rico | Utuado | 72141.0 | NURSERY | EDIBLE CONTAINER | Blank | I | 0.0850 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0850 | 2020 |
| 539119 | 72 | 141 | 1166 | Puerto Rico | Utuado | 72141.0 | CAIMITO | NaN | Fresh | N | 0.9700 | 0.0 | 0.0 | 0.0 | 0.00 | 0.9700 | 2020 |
| 539120 | 72 | 141 | 1167 | Puerto Rico | Utuado | 72141.0 | GUAMABANA/SOURSOP | NaN | Fresh | N | 0.4856 | 0.0 | 0.0 | 0.0 | 0.00 | 0.4856 | 2020 |
| 539121 | 72 | 141 | 1190 | Puerto Rico | Utuado | 72141.0 | HONEY | NaN | Fresh | O | 0.0000 | 0.0 | 0.0 | 0.0 | 0.25 | 0.0000 | 2020 |
| 539122 | 72 | 141 | 1290 | Puerto Rico | Utuado | 72141.0 | BREADFRUIT | NaN | Fresh | N | 54.8800 | 0.0 | 0.0 | 0.0 | 0.00 | 54.8800 | 2020 |
| 539123 | 72 | 141 | 7037 | Puerto Rico | Utuado | 72141.0 | JACK FRUIT | NaN | Fresh | N | 1.9400 | 0.0 | 0.0 | 0.0 | 0.00 | 1.9400 | 2020 |
| 539124 | 72 | 141 | 7164 | Puerto Rico | Utuado | 72141.0 | RAMBUTAN | NaN | Fresh | N | 10.6724 | 0.0 | 0.0 | 0.0 | 0.00 | 10.6724 | 2020 |
| 539125 | 72 | 141 | 7208 | Puerto Rico | Utuado | 72141.0 | MANGOSTEEN | NaN | Fresh | N | 8.7300 | 0.0 | 0.0 | 0.0 | 0.00 | 8.7300 | 2020 |
| 539126 | 72 | 141 | 8005 | Puerto Rico | Utuado | 72141.0 | LYCHEE | NaN | Fresh | N | 0.9700 | 0.0 | 0.0 | 0.0 | 0.00 | 0.9700 | 2020 |
Most frequently occurring
| State_Code | County_Code | Crop_Code | State | County | State_County_Code | Crop | Intended_Use | Irrigation_Practice | Planted_Acres | Volunteer_Acres | Failed_Acres | Prevented_Acres | Not_Planted_Acres | Planted_and_Failed_Acres | Crop_Year | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2244 | 37 | 7 | 7501 | North Carolina | Anson | 37007.0 | FLOWERS | Fresh | I | 0.0100 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0100 | 2020 | 40 |
| 2553 | 39 | 27 | 7501 | Ohio | Clinton | 39027.0 | FLOWERS | Fresh | N | 0.0100 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0100 | 2018 | 17 |
| 624 | 18 | 181 | 5000 | Indiana | White | 18181.0 | HERBS | Fresh | N | 0.0023 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0023 | 2020 | 16 |
| 2632 | 39 | 63 | 7501 | Ohio | Hancock | 39063.0 | FLOWERS | Fresh | I | 0.0020 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0020 | 2018 | 15 |
| 2506 | 38 | 31 | 53 | North Dakota | Foster | 38031.0 | GRAPES | Fresh | N | 0.0174 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0174 | 2018 | 13 |
| 1952 | 35 | 25 | 53 | New Mexico | Lea | 35025.0 | GRAPES | Fresh | I | 0.0900 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0900 | 2018 | 12 |
| 1953 | 35 | 25 | 53 | New Mexico | Lea | 35025.0 | GRAPES | Fresh | I | 0.0900 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0900 | 2019 | 12 |
| 1954 | 35 | 25 | 53 | New Mexico | Lea | 35025.0 | GRAPES | Fresh | I | 0.0900 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0900 | 2020 | 12 |
| 2504 | 38 | 31 | 53 | North Dakota | Foster | 38031.0 | GRAPES | Fresh | N | 0.0130 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0130 | 2019 | 12 |
| 2505 | 38 | 31 | 53 | North Dakota | Foster | 38031.0 | GRAPES | Fresh | N | 0.0130 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0130 | 2020 | 12 |